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ABSTRACT 



Since mathematical English (ME) differs from 
ordinary English (OE) in the number of symbols used , this research 
investigated sequential constraint (constraints on symbol choice 
attributed to preceding textual material) of excerpts from 18 
mathematics books, both traditional and modern, to determine its 
relationship to readability. Findings indicated the following: (1) 

the length of the total passage must be considered; (2) sequential 
constraint did not differ for modern and traditional mathematics 
books; (3) sequential constraint varied between topics, which implies 
that no value of sequential constraint can be assigned to ME; (4) an 
inverse relationship existed between sequential constraint and grade 
level; (5) there was more constraint in the deductive style of 
writing; and (6) there was an inverse relationship between sequential 
constraint and reading comprehension of ME. Implications for teaching 
would place greater emphasis on topics having high constraint since 
such topics were associated with lower reading scores, and topics 
having low constraint might be developed in greater depth since they 
were associated with higher reading comprehension scores. Included 
are a bibliography, a list of definitions of technical terms, and 
tables. (DH) 
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UJ INTRODUCTION 

Human behavioral study can be thought of as an investigation of 
sequences of chain reactions. Humans react to the behavior of their 



predecessors and in turn Influence others. Language is one important 
aspect of human behavior which is also Important in the educational process. 
For instance, Carroll (1968) wrote* 

By far the largest amount of teaching 
aotivity in educational settings involves 
telling things to students, whether orally 
or in print. Traditional Instruction 
characteristically uses the lecture method, 
along with plentiful reading assignments* 

Even in more "progressive" educational 
settings which avoid the lecture method, 
much of the teacher's activity consists of 
A9klng questions and imparting information 
verbally, ... We expeot our students to 
learn most things by being told about them 
(p. 1). 

A textbook is a conventional instructional aid which utilizes lanuage. 

This medium is probably the most widespr ad learning device In the American 
educational system. 1 






In contemporary mathematics textbooks the topics discursed and the 
related pedagogy are somewhat different than they were a decade ago. 

(X) These changes are attributable in large part to the efforts of numerous 
mathematics curriculum groups. Among these are the School Mathematics 
CQ Study Group (SMSO), University of Illinois Committee on School Math- 
hematics (U1CSM), Madison Project, University of Maryland Mathematics 
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Throughout the paper the reader will encounter some unfamiliar words. 
Formal definitions of these technical terms appear after the references. 
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Project (UMMaP), Greater Cleveland Mathematics Program (GCMP), and the 
Commission on Mathematics of the College Entrance Examination Board* 
Diverse topics such as sets, numeration systems, descriptive statistics, 
logic, and probability are presented by at least one of the groups whereas 
a decade ago few of these topics were even mentioned In our schools. 
Different methods and emphases In teaching mathematics ore also evident. 
For example, In recent years discovery learning has appeared In varying 
degrees In contemporary mathematics programs and attention is given to 
structure, precision of language, and the spiral approach to curriculum 
development. 

The sucess of the materials written by the proponents of curriculum 

reform should be measured In large part by student performance. A 

contributing factor to student performance Is the readability of textual 

materials* Mathematical English (ME) differs from ordinary English (OE) 

In many aspects, some of which may affect readability* For example, 

ME Includes not only the 26 symbols of the alphabet, punctuation marks, 

end of sentence, and space but also numerals, operation signs, and other 

specialized symbols. In addition, a tremendous compacting of Information 

n 

occurs throughout ME by the use of speolal syral Is such as £ p(l) log j 
b 1=1 

/ f(x)dx, etcetera. Moreover, In OE many nouns ore rich In connotatloi 
a 

whereas In ME nouns that name mathematical objects generally have a single 
denotatum (at least for a given author). Any extraneous meaning such 
nouns may evoke is "noise." 

Differences within MB Itself occur because of the appearance of 

diverse topics and the concomitant symbolism. For example, elementary 

b 

school mathematics generally does not Include >; , Ax, / , etcetera, 

a 

symbols which occur frequently In higher grades. Moreover, even math- 
ematics textbooks written for the same grade level have different 
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symbols* Contemporary programs used set notation which rarely appeared 
a decade ago# Close examination of a mathematics book reveals dlf erences 
In writing within the book#. Sections devoted to motivational materials 
seem to more closely approximate OE than do those concerned with proofs* 
Perhaps these Illustrations demonstrate that the characteristics of ME 
and OE are riot the same and therefore may require different reading 
skills to attain acceptable levels of reading comprehension. 

The readability of OE has been subjected to experimentation for 
many years# Consequently, some factors which can be utilized In predicting 
the readability of OE hqve been Isolated# Among these are average 
sentence length; average word length, number of familiar words In a 
passage, and the number of syllables per 100 words# Generally a combination 
of some of these factors Is used In a multiple regression equation for 
predicting the readability of OE passages# However, Chall (1958# P« 202) 
and Kano (196?) suggest that these equations may not be applicable to 
the prediction problem In mathematics# 

An approach to the readability problem In ME should attempt t< 

Isolate variables which are related to reading comprehension. Such pre- 
dictor variables ml^ht be the syntactical complexity, cloze scores 
(the number of correct Insertions made Into the blanks obtained by 
deleting every Kth word of a passage, where every K Is a positive Integer), 
proportion of non-English symbols, vocabulary familiarity (the number of • 
words in a pansage that are on a list of familiar words), and sequential 
constraint# The current research was directed toward Investigating 
sequential constraint# More specifically, the purpose of this study 
was to analyze and compare the sequential constraint of excerpts from 
mathematics books and determine the relationship between sequential 
constraint and readability# 
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Samples of writings from 18 mathematics textbooks were analyzed in 
this study. Half of the 18 textbooks used a traditional and half used a 
modern approach to mathematics. A listing of the books at each of three 
divisions (elementary, Junior high, and senior high) Is given In Tables 
1 , 2 » and 3» 

In addition, five passages of diverse mathematical content were 
utilized. Two passages Involved matrices, one logic, one the metric system, 
and one statistics. 

The pattern of language evolvement In textual material .'.s sequential 
since an order of perception is established. In reading the reader 
progresses f^om left to right and, moreover, prior context influences 
the future appearance of letters. Thus the occurrence of a letter may 
depend not only on Immediately adjacent letters, but constraints may 
extend over much of the prior context. A tool for measuring these 
constraints must therefore use the probabilistic statements Inherent In 
evolving sequential data. 

An appropriate mathematical model for analyzing date of a sequential 
nature is a Markov chain with a discrete time parameter (Binder & Wolln, 
1964). A characteristic of Markov chains, making them especially appropriate 
for application to the entropy concept of Information theory, Is that 
the probability of occurrence of a state is contingent upon only the 
immediately preceding state and none before that. For example, the assumpt ^ 
1 8 made that the probability of occurrence of a speolflc letter (jtate) 
depends only on the immediately preceding, say 20, letters and none before 
those (it 1 8 Irrelevant to the prediction problem whether the probability 
of a specific letter or the probability of the state which is Induced by 
the letter is determined). Assuming, for this example, a 28-letter 
alphabet (26 letters, end of sentence, and space) there would be (28) 




states with an accompanying probability value attached to each. The 
probability values when substituted in the appropriate formulas from 
Information theory give an estimate of the 21-gram entropy. Since the 
determination of entropy is a limiting process, the above procedure would 
be repeated for dependencies extending over spans greater than the 20 
preceding letters* 

While it would be desirable to successively approximate the entropy 
by using longer and longer sequences for predicting the occurrence of a 
letter, the problem becomes insurmountable quickly. For example, with 
prediction depending on only the Immediately two preceding letters 
there are (28) possible states (assuming a 28 letter alphabet). 

Tabulation of the frequency of occurrence of each of these digrams is 
possible (Shannon, 1951) and the resulting estimate of entropy follows 
readily. However to estimate the entropy from sequences of the 10 preceding 
letters Is quite another /iatter. In this case there are (28)*® states 
and in order to estimate the entropy the length of the English passage 
from which the frequency distribution arouse would be prohibitively long* 

These examples illustrate the need for an alternate approach to 
entropy determination, Newman and Gerstman (1952) proposed the coefficient 
of constraint as a device for estimating the entropy. An adaptation of 
the latter met' od was used to obtain estimates of sequential constraint 
In this study. 

A limitation of the present study Is that all art work, tables, and 

« 

figures were eliminated from the textual material In which they were 
embedded since in most cases it was impossible to decide on symbol order- 
ing, Certainly one would expect constraints In textual inaterlol to 
originate from such configurations but order of perception could not 
be determined. For exemple, if a parabolic curve showing various re- 
lationships appeared, no readily attainable agreement on what or how to 



analyze sequentially the Information contained therein was obvious. 

RESEARCH QUESTIONS 

This research Is directed toward the elaboration of the sequential 
constraints In ME and the relation of these constraints to reading 
comprehension. Smaller samples of the textual material are often used to 
estimate the sequential constraint for entire books. Typically a 
researcher will sample a continuous subset of characters, calculate the 
values of the sequential constraint, and then extrapolate to the materials 
In which the sample Is embedded. The proximity of Informational values 
determined from samples to the corresponding values found In entire text- 
books or chapters IK a question which Is investigated In the present 
research. 

When examining the constraint existent in ME a researcher is concernec 
with any variation in that quantity which may occur. It Is therefore 
appropriate to determine what differences exist in sequential constraint 
for modern and traditional mathematics books, within a specified 
mathematics book, between mathematics books at different gride levels, 
and for different types of discourse. 

In summary, this study was designed to answer the following questions 
1 , Is there an optimal sample Ql2e, In symbols, which may be used to 
compute a measure of sequential constraint? 

2. Is there a difference In the sequential constraint of modern and 
traditional mathematics textbooks? 

3, Is there a difference In sequential constraint for topics within 
a single mathematics textbook? 

Is thei-e a difference In sequential constraint among mathematics 
textbooks written for different grade levels? 

5. Is there a difference In sequential constraint between deductive and 
nor.-deduotl ve mathematics textual materials? 
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6 o What is the relationship between sequential constraint and reading 
comprehension of mathematics passages? 

PROCEDURE 

Excerpts from 18 mathematics books and five mathematical passages 
were used in tills study# With the exception of art work, figures, and 
tables, all characters which appeared In any selection were keypunched 
on cards for analysis on a CDC 6500 computer# The coding represented 
not only the usual 26 letters of the alphabet but also a symbol for the 
end of a sentence, space, =, + , and the other symbols that commonly 
occur in mathematics textual materials# Internal punctuation within 
ordinary English sentences was disregarded as were page numbers# 

Chapter titles were included# Conjunctions and hyphenated words were 
treated as single words, for example, 'let's' would be punched In four 
consecutive columns with the apostrophe omitted and 'non-zero' would 
be treated as one word and keypunched Jn seven consecutive columns# 
Textual material was placed on cards in the order that It would be read# 
To illustrate, $ Is read one-fourth so the first symbol to be placed 
on the card Is 1, then a symbol for the vinculum, and finally the *<« 

After each end-stop symbol, that Is a period, exclamation mark, question 
mark, or colon, a space was allowed# No spaces were keypunched within 
any equation# Decisions leading to the above rules depended on the order 
In which textual material is rend by a render and previous studies 
(Newmnn A Gerstman, 1952 i Newman A Wau^h, 19601 Paisley, 1966) on 
redundancy for 0E# 

2 

The Computer Program 

The computer program was developed In two sections, Programs A and ii« 
Program A encoded textual material Into machine characters# In Program 

- 7 — ■ — 

The researcher Is Indebted to Robert Crlpe for his assistance 
In the development of the computer program. 
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B contingency tables were constructed showing the frequency with which 
each character followed every other character Immediately and at distances 
of 2, 3i •«*» 15* All computations were done In Program B* The total 
program is quite flexible and can, with minor alterations, accommodate 
up to 126 distinct characters, In addition contingency tables can be 
constructed for symbols separated by a maximum of il9 intervening characters, 

ANALYSIS 

Shannon (19^8) defined (relative) redundancy as one minus the 
relative entropy where the relative entropy is the ratio of the entropy 
to the largest value It could have while still restricted to the same 
symbols, Thus the relative redundancy (R) is given by the formula* 

» - i - i U) 

nom 

where the entropy H = lim P N , = - ^p(l) Pjj logg p ij* P(*) 18 
probability of the (N-l) -gram 1, p^j is the probability of the symbol 
J given the (N-l) -gram 1, and the nominal value of the entropy H nom « logg n 
where n Is the number of distinct symbols. As mentioned previously, 

If statistical effects extend over N-grams, as N becomes large the 
calculation of the entropy is not possible* 

Garner and Carson (I960) separated redundancy Into two parts, the 
distributional constraint and the sequential constraint* For example, 
the model for redundancy when only the N-l preceding variables (each 
letter position is a variable) are considered 1st 

H n 0n - p N * < H no» - *W + < M »a* ' V' ! 2 J 

In Equation ? »H n0rn Is defined as before, H gives the uncertainty 
when the symbols are Independent but not necessarily equally probable and 
is defined as ^ p(l) logg P(D» p(l) Is the probability of the 

symbol 1, and F^ Is the N-gram entropy. The left hand side of Equation 
2 approximates the numerator of the Shannon formula for redundancy and, 

In fact, would be identical if statistical effects did not extend over 
O 
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sequences longer than N~1 symbols In length, that Is, If F^j = H© The 
expression within tho first parenthesis on the right hand side of Equation 
2 Is the distributional constraint© This expression gives the reduction 
in uncertainty attributable to unequal frequency of occurrence of symbols© 
To Illustrate, If there were no syntax or spelling rules and If all 
letters occurred with almost the same frequency, then the reduction in 
uncertainty would be minuscule and prediction of any letter In a 
sequence would be little better than chance© The expression within the 
second parenthesis on the right hand side of Equation 2 Is the sequential 
constraint which gives the reduction in uncertainty due to statistical 
effects extending over sequences of length N-l© 

The distributional constraint is, of course, easily obtainable and 
Is Independent of problems Inherent In working with sequential depend- 
encies© However, calculation of a value for the sequential constraint 
Is more difficult since the N-gram entropy term appears. Binder and 
Wolln (1964) proved that the sequential constraint Is equal to the 
multiple contingent uncertainty. Consequently, the problem of determining, 
the sequential constraint reduces to finding the multiple contingent 
uncertainty, 

A technique suggested by Newman and Gerstman (1952) ha« been 
adapted In the current research to the problem of estimating the multiple 
contingent uncertainty© The procedure consists of calculating the simple 
contingent uncertainties and summing those to estimate the multiple 
contingent uncertainty© The specific formulas utilized In tills study to 
estimate the multiple contingent uncertainties and subsequently the 
relative sequential constraints from the summation of simple contingencies 
follow© 

An estimate C n of the relative sequential constraint for sequence 
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of length n Is given byi 

n 

£ 

C„ = k=2 H(1 iK) ' VI 

n " hTiT 

In formula 3, H(liK) = H(l) - H K (1), where H(1 ) = - £ p(i) log 2 p(l) # 

the summation being over the entire alphabet, and H K (1) is the uncertainty 

of the letter being predicted when only the (K-l)th preceding letter 

is used in the prediction,, Formally, H^(l) = -£ p(l) Pjj log p^, 

where p^ is the probability of symbol J given that i occurred k-1 letters 

before it. The indices on this summation sign range over the entire 

alphabet being conslderedc The expression appearing in the numerator, 
n 

> H(lrK), is the summation of the simple contingencies, H(l) - H„(l), 
k=2 K 

culminating in an estimate of the multiple contingent uncertainty when 

only the N-l preceding variables in the Markov chain are utilized* 

Several researchers have used a formula commonly referred to as the 

Mlller-Madow formula and presented by Miller (195 5) for correcting sample 

bias of certain informational functionals computed from sequential data. 

Among the researchers are Newman and Waugh (I960) and Paisley (1966), 

The correctional formula utilized by Paisley is well disguised. As 

pointed out by Garner (19&2) this correctional formula is suitable when 

the underlying populations are either univariate or bivariate but seems 

less appropriate with overlapping data where the source is clearly 

Markovian (Binder & Wolin, 196^)* 

RESULTS 

An adaptation of the Newman-Gerstman (1952) method was employed in 
the present study to estimate all values of the relative sequential 
constraint for ME, Successive values of the simple contingencies through 
16 were summed to approximate the relative sequential constraint. In 
most Instances simple contingencies beyond 16 characters were constant. 
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Proximity of Sampl e Estimates of l 6 -Letter Belatlve 
Sequential Constraint to the Mathematical 
Language In Whloh the Samples are Embedde d 
Estimates of relative sequential constraint calculated from samples 
will often vary from those of the larger textual materials in which the 
samples are embedded* The nature of this variability was Investigated. 

One aspect of the problem was to determine how representative a selected 
sample Is of other equal length samples from the same textbook* For 
example, if a 5,000 symbol random sample of continuous textual material 
leads to a certain value of the relative sequential constraint, how 
representative of other 5*000 symbol random samples is that sampled 

Five random samples each of 5*000, 20,000, 25*000, 30,000, 35*000, 
and 40,000 characters were Selected from each of two mathematics textbooks ts 
answer this question. The estimates of l 6 -letter relative sequential 
constraint for the samples from Learning Mathematics (Deans, Kane, 

McMeen & Oesterle, 1968 ) and Exploring Elementary Algebra (Keedy, Jameson, 
Johnson & Ciechon, 196?) are tabulated in Tables 4 and 5o The variance 
for each sample size from each selection is reported in Table 6 . The 
figures Imply that representativeness Is closely allied to sample size. 

For example, In L earning Mathematics and Exploring Elementary Algebra 
variability among the five 5*000 symbol samples was 0,173 und 0.095 
while for the 40,000 symbol samples variability was 0.003 and O.O 36 . The 
variability for Intermediate sample sizes decreased with Increasing sample 
size with a few exceptions. Thus In both books variability between samples 
was greater for the smaller samples and smaller with the larger sample sizes. 
Another problem in extrapolating from samples to larger selections 
of textual material was the proximity of the sample means for estimates 
of the l 6 -letter sequential constraint to the corresponding values In the 
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larger selection* The means of the estimates of 16-letter relative 
sequential constraint for the five samples from each of the six sample 
sizes together with estimates of 16 -letter relative sequential constraint 
for the total selection are presented In Table 7 . With one exception 
an increase In sample size resulted In a better estimate of relative 
sequential constraint for the total selection, Thus the mean values of 
the l 6 -letter relative sequential constraint for the five 5»000 symbol 
samples of textual material from Learning Mathematics and Exploring 
Elemen t ary Algebra were lo?l4 and 2.096 while the mean values for the 
five *<0,000 symbol samples were O.S*<8 and 1.399* The l 6 -letter relative 
sequential constraint for the total selection from each of the two books 
was 0.853 and l*35^c Therefore, the *<0,000 symbol samples yielded mean 
values of l 6 -letter relative sequential constraint which were closer to 
the corresponding entries In the total selection than were those for the 
5,000 symbol samples c For these data the proximity of the approximation 
seems to be contingent upon both the sample size and the length of the 
total selection. 

Differences In Estimates of Relative Sequential 
Constraint Be tween Modern and Traditional 
Mathematics Books 

English language depends on only 26 discrete letters together with 
a space and punctuation to convey meaningful ideas. Within mathematical 
language a larger number of symbols is utilized. Consequently, a 
mathematics book usually contains many symbols not found In ordinary 
English books. 

Alphabet ' size and estimates of the l 6 -letter relative sequential 
constraint for two 20,000 symbol random samples of mathematical language 
from different books at each of four elementary school grade levels are 
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reported In Table 8 , At each grade level the samples were selected from 
passages in which a common topic was presented. 

In three of the four comparisons presented in Table 8 alphabet 
size is greater for the modern books. However, for relative sequential 
constraint two of the comparisons indicate more restraint for modern 
language while the reverse is true for the remaining comparisons. Thus 
while Developing Mathematics , Understanding Mathematics , and Learning 
Mathematics have 55# 62, and 56 distinct symbols only the latter two 
books have greater 16 -letter relative sequential constraint, 1,656 and 
1 , 638 , than their counterparts. 

Estimates of 16-letter relative sequential constraint for excerpts 
from each of four Junior high school mathematics books are reported in 
Table 9* Alphabet size and relative sequential constraint are larger for 
modern mathematics textbooks in one of the two comparisons presented. 

The corresponding structural characteristics of mathematical 
language for six books at the senior high school level are enumerated 
in Table 10, More constraint for modern mathematics books is found in 
only one of the three comparisons given in Table 10 while a smaller 
alphabet occurs In a modern book In one of the three comparisons. Thus, 
E xploring Elementary Algebra is the only modern mathematics textbook In 
the three comparisons with more 1 6 -letter relative sequential constraint 
than its counterpart ( 1,869 versus 1,079), The only modern mathematics 
book reported in Table 10 having fewer distinct symbols than' its counter- 
part (60 versus 65 ) is Advanced High School Mathematics , 

When comparing the modern and traditional books presented in Tables 
8, 9, and 10 alphabet size Is greater for modern books in six of the nine 
comparisons but relative sequential constraint Is larger In only four of 
the nine comparisons. It also should be noted that in seven of the nine 




comparisons the selection with the smaller alphabet has the smaller 
estimate of relative sequential constraint. Alphabet size seems to be 

V 

directly related to relative sequential constraint for these data, at 
least when topic is controlled. 

Within Book D1 f ferences for Estimates of 
Relative S equentia l Constraint 

One question Investigated was whether relative sequential constraint 
varied between topics within a mathematics book. Excerpts from four 
textbooks were used to study this. The books, topics selected, alphabet 
size, and estimate of l 6 -letter relative sequential constraint are 
tabulated In Table 11, It Is apparent in all four comparisons that topic 
is related to estimates of relative sequential constraint* For example, 
the two topics, fractions and geometry, in Learning Mathemati cs had 
1,638 and 1,052 respectively as estimates of 16 -letter relative 
sequential constraint. For this book more constraints are Imposed on 
textual material when fractions are discussed than when geometry is 
presented. In addition, for three of the four comparisons there is an 
inverse relationship between alphabet size and estimate of relative 
sequential constraint. This Is In contrast to the association between 
alphabet size and estimates of relative sequential constraint noted 
earlier. 

Differences In Est lmates of Relative Sequential 
Constraint Between Grade Levels 

Some insight Into the variation In relative sequential constraint 

between 9 -rade levels can be obtained by controlling topic and authorship, 

r 

The relevant data for this aspect of the study.^ln Table 12, While a 
direct relationship between alphabet size and estimates of relative 
sequential constraint Is apparent In these data It Is also noteworthy 
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that sequential constraint decreases with increasing grade level* To 
Illustrate, Row-Peterson Arithmetic 4 Is less constrained than Row-Peterson 
Arithmetic % (I«4l0 versus 1*469) and Extending Mathematics * a book for 
eighth graders, has less sequential constraint (0*856 versus 1 . 052 ) 
than its counterpart, a book for fifth grade students. More Information 
is contained In passages that occur at higher grade levels than their 
counterparts of equal length at lower grade levels* 

Differences In Estimates of Relative Sequential Constraint 
for Two Styles of Mathematical Language 
Some of the writing found In mathematics books is deductive In 
nature* A question of Interest was whether measures of relative sequential 
constraint for deductive textual materials differ from the corresponding 
values enumerated from less directive discourse. To answer this question 
two 10, 4l4 symbol samples of textual material were selected within topics 
which contain both expository styles. Alphabet size and estimate of 
16-letter relative sequential constraint are given in Table 13* Relative 
sequential constraint and alphabet size are greater for the deductive style 
of ME investigated In this study. 

Relative Sequential Constrain t 
and Reading Comprehe ns Ion 

To ascertain the degree of association between relative sequential 
constraint and reading comprehension five passages from Hater's (1969) 
study were used. The students in Hater's study were enrolled In grades 
7 through 10 of Roman Catholic parochial schools in Cincinnati, Dayton, 
Springfield, and Lincoln Heights, ohlo. A reading comprehension test 
was given to approximately 125 randomly selected students from the above 
schools on each of the five passages, a mean was determined for the 125 
reading comprehension test scores on each passage. These five means 
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together with a measure of relative sequential constraint for each of the 
five passages are reported in Table • Reliability Indices for the 

reading comprehension tests were computed using the Kuder-Richardson 

o- 

Formula 20. For each passage the rellbillty coefficient was at least 0*78. 
The Spearman Rank Correlation Coefficient was used to obtain a measure of 
relationship between mean scores on reading comprehension tests over 
the passages and relative sequential constraint. This coefficient was 
-0*30. 

CONCLUSIONS 

The purpose of this study was to quantify and compare the sequential 
constraints (constraints on symbol choice attributable to preceding 
textual material) extant In ME passages and to ascertain the degree of 
relationship between these constraints and reading comprehension. Data 
consisted of textual materials from 18 mathematics books. In addition, 
five passages from Hater’s (1969) study were utilized. The textual 
materials were keypunched In a uniform format for processing on a CDC 6500 
computer. Analysis proceeded according to a technique Initiated by Newman 
and Oerstman (195?) « Briefly, the constraint imposed on the criterion 
variable (symbol being predicted) by each of the predictor variables 

(preceding m smybols vrhere m = 1, 2 15) was determined. These 

constraints were then summed resulting in an estimate of sequential 
constraint. 

The reader's attention was directed to the problems inherent in a 
relative frequency interpretation of probability associated with information 
theory concepts. Such difficulties confront the researcher concerned 
with applications of information theory If only finite samples are 
available. These restrictive conditions exist In language analysis. 

A perusal of the literature revealed corrective formulas when the underlying 
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models are either univariate or bivariate. However, no formula could be 
found to correct for sample bias with overlapping sequential data. Such 
data are innately characteristic of language. 

The usual procedure in analyzing language rests on the assumption 
that a continuous sample of symbols is representative of the textual 
material in which it is embedded. Thus a researcher typically selects 
a sample , computes a measure of constraint on the sample, and extrapolates 
to the larger selection containing the sample. The validity of this 
assumption for ME was examined in the present study. Five samples of 
continuous textual material of 5i000, 20,000, 25 , 000 , 30 , 000 , 35,000 and 

j 

^0,000 symbols were randomly selected from each of two passages containing 
^7» 295 and 113,097 symbols. Measures of constraint were obtained for 
each sample, and the mean and variance for each sample size within each 
passage was determined. In addition, measures of constraint on the 
^7,295 and 113,097 symbol passages were found. Implications drawn from 
these data indicate that extrapolation is enhanced with increasing length 
of passages since variability within samples and distance from total 
selection means is reduced. Moreover, the length of the total passage 
must be considered. While a sample of a specified length may be entirely 
adequate when discussing constraint for a certain length selection, it 
may not be adequate for longer selections. 

One question investigated was whether constraint differs for modern 
and traditional mathematics books. At a fixed grade level textbooks 
illustrative of each approach were chosen. The topic was controlled 
between books, and 20,000 symbol passages were randomly drawn from each 
of the two textbooks. For each passage alphabet size and constraint 
were determined. Neither modern nor traditional textbooks consistently 
had greater sequential constraint although modern books used more symbols 
In six of the nine comparisons presented. Also alphabet size was directly 
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related to relative sequential constraint, at least when topic was 
controlled, In seven of the nine comparisons. 

Another aspect of the research was to determine whether relative 
sequential constraint fluctuates between topics within a book. Passages 
containing 20,000 symbols were randomly selected from each of two topics 
for each of four textbooks. Results Indicated that sequential constraint 
varies between topics. The Implication Is that a unique value of 
constraint for ME, even within a given textbook, is nonexistent. That 
Is, no value of sequential constraint can be thought to be indicative of 
ME. 

Another question was whether constraint varies with ascending grade 
level. To answer this question 20,000 symbol passages were randomly 
selected from textbooks at different grade levels, but with topic and 
authorship controlled. Measures of constraint on these passages 
revealed an Inverse relationship between relative sequential constraint 
and grade level. Thus textual material at the third grade level was 
more constrained than that at the fourth grade level. Whether the 
increased constraint found at lower grade levels Is a help or deterrent 
to reading comprehension Is another question which will be discussed below. 

In mathematics textbooks some of the language Is concerned with 
deductive reasoning. The comparison of constraint for this language 
st.yl° and less directive discourse was also Investigated, A passage of 
each language style was selected from each of two mathematics books. 
Alphabet size and relative sequential constraint were determined for each 
of the ME passages. Results Indicated that the deductive style of 
presentation was more constrained and had a greater number of symbols than 
the less directive discourse. 

Five passages over which reading comprehension tests had been 
administered were analyzed In the final phase of the study. This phase 
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sought to Investigate the relationship between reading comprehension 
of ME and relative sequential constraint# Measures of constraint on 
each of the five passages were determined. The correlation coefficient 
Indicated an Inverse relationship between relative sequential constraint 
and reading comprehension. Thus more constrained textual material seems 
to result In lower scores on reading comprehension tests, at least for 
ME# Herein may lie a distinction with OE where a direct relationship 
exists between reading comprehension and constraint# Possibly topics 
which have low constraint associated with them might be developed to a 
greater depth since low constraint Is associated with higher scores on 
reading comprehension tests. That Is, detailed discussion of peripheral 
areas related to topics having low constraint may be beneficial# 
Furthermore, greater emphasis In teaching should be placed on those topics 
having high constraint since such topics are associated with lower reading 
comprehension scores. 
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DEFINITIONS 

1. Deductive Textual Material . Material which results when an axiomatic 
system Is applied over a sequence of steps leading a person from 
initial conditions to the conclusion, examples are proofs of theorems, 
lemmas, etcetera. 

2. Entropy (H) . The minimum average number of binary digits required 

to encode each character of textual material, formerly H = llm F M 
where F^, is the N-grara entropy: Information; uncertainty . 

3. Inf ormntlon . See entropy. 

, Letter Redundancy . Redundancy measurement In which the basic 
sampling units are letters. 

5. Markov Chain With A Discrete Time P arameter . A stochastic process 

such that P [ X. = x, ) X. = x, = x. 3 = Pfx,. = x. I 

n n ' t n-l t n-l t i c l c n c n 

= x t 1 f° r any Integer n $ 1 and any set of n time points 

n-1 n-1 

0 ^ t, im <t„ In the Index set T whore the values are discrete and T 
1 n 

Is finite. Less formally, a stochastic process which moves through 
a finite number of states, end for which the probability of entering 
a certain state depends only on the last state occupied. 

6. Mathematl cal English (ME) . The written language found in math- 
ematics textual materials. 
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7 o Multiple Contingent Uncertainty , The total amount of uncertainty 

in the criterion variable which can be predicted from simultaneous 

values of the preceding variables* 

8, N-gram Entropy * Information when the N-l preceding letters are 

used In predicting the Nth letter of a sequence N letters long, 

formally = -c’P (1) log 2 w bere P (1) Is the probability 
1 1 J 

of the (N-l) -gram 1 and P^ is the probability of the single symbol 
j when the (N-l) -gram 1 is given* 

9* Readability of a Passage » A group mean on a reading comprehension 
test over the passage. 

10, Redundancy * 1 - where H Is the entropy and H max Is the 

entropy which would result if all states were independent and equally 
probable* The redundancy Is a measure of the constraint Imposed 

on textual material due to its statistical structure, for example, 

In English the tendency of H to follow T* 

11, Relative Sequential Constraint » A measure of redundancy computed 
from the summation of contingent uncertainties. 

12, Simple Contingent Uncertainty , A measure of the amount of uncertainty 
reduction due to the contingencies between the initial predictor 
variable and tbe criterion variable* 

13» Single Letter Uncertainty (H( 1 ) ) * Uncertainty when each letter Is 

Independent of every other, formally H(l) = -£P(1) log,, P (1) where 

1 

P(i) Is the probability of the letter 1* 

Uj, State * Some specific set of values of all the variables of concern, 
15* Stochastic Process , An arbitrary family of realvalued random 

variables j X^. f t In Tj where T is the set of all possible times 
and the possible values (states) of at time t are x fc » 

16. Textbooks Which Illustrate ^ Modern Approach To Mathematics 

Education * Mathematics books whose most recent copyright date 
1 8 at least 1963 . 
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17* Textbooks Which Embody A Traditional Approach To Mathematics 

Education * Mathematics books whose most recent copyright date 
Is 1962 or before 0 ' 

18. Uncertainty. See entropy. 




TABLE 1 



ELEMENTARY SCHOOL MATHEMATICS TEXTBOOKS 
USED FOR THIS STUDY 



Textbook 


Publisher 


Grade 

Level 


Copyright 

Date 


Developing Mathematics 


American 

Book 

Company 


3 • 


1968 


Row-Peterson Arithmetic 


Row, 

Peterson and 
Company 


3 


1959 


Understanding Mathematics 


American 

Book 

Company 


k 


1968 


Row-Peterson Arithmetic 


Row, 

Peterson and 
Company 


k 


1959 


Learning Mathematics 


American 

Book 

Company 


< 

✓ 


1968 


Row-Peterson Arithmetic 


Row, 

Potorson and 
Company 


5 


1959 


Unifying Mathematics 


American 

Book 

Company 


6 


1960 


Row-Peterson Arithmetic 


Row, 

Peterson and 
Company 


6 


1959 
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TABLE 2 



JUNIOR HIGH SCHOOL MATHEMATICS TEXTBOOKS 
USED FOR THIS STUDY 



Textbook 


Publisher 


Grade 

Level 


Copyright 

Date 


Structuring Mathematics 


American 

Book 

Company 


7 


1968 


Rovi-Peterson Arithmetic 


Row, 

Peterson and 
Company 


7 


1959 


Extending Mathematics 


American 

Book 

Company 


8 


1968 


Row-Peterson Arithmetic 


Row, 

Peterson and 
Company 


8 


1959 



TABLE 3 

SENIOR HIGH SCHOOL MATHEMATICS TEXTBOOK 
USED FOR THIS STUDY 





Textbook 


Publisher 


Grade 

Level 


Copyright 

Date 



Exploring Elementary 
Algebra 


Holt, 

Rinehart and 
Winston, Inc, 


9 


1967 


Algebra - Book One 


Ginn and 
Company 


9 


I960 


Exploring Geometry 


Holt, 

Rinehart and 
Winston, Inc. 


10 


1967 


A First Course In 
Geometry 


The L. W. 
Singer Company 


10 


1959 


Advanced High School 
Mathemntlcd 


Charles E. 
Merrill Books, 
Inc. 


12 


1965 


Elementary Mathematical 
Analysis 


D. C. Heath 
and Company 


12 


1962 



O 

ERIC 



TABLE *4 



FIVE ESTIMATES OF 16-LETTER RELATIVE SEQUENTIAL 
CONSTRAINT FOR EACH OF SIX SAMPLE SIZES* 



5,000 


Sample Size (In Characters) 
20,000 25,000 30,000 35.000 


*4 0,000 


1.780 


1.088 


0.932 


0.902 


0.8*42 


0,80? 


1.133 


1 .06/4 


0.779 


0.900 


0.8*49 


0,802 


2.052 


0.858 


0.952 


0.90*4 


0.8*48 


0.908 


1.360 


1.179 


0.955 


0.886 


0.962 


0.910 


2 . 0/46 


I.069 


0.980 


0.899 


0.959 


0.811 



*The samples were embedded within Chapters 1 and ? of 
Learning Mathematics . These chapters consisted In total of 
*47, 295 characters . 

TABLE 5 

FIVE ESTIMATES OF 16-LETTER RELATIVE SEQUENTIAL 
CONSTRAINT FOR E JH OF SIX SAMPLE SIZES* 



Sample Size (In Characters) 



5,000 


20,000 


25,000 


30,000 


35,ooo 


*40,000 


2.315 


1 .880 


1 .3*4 3 


I.816 


1.798 


1.6*49 


2.0?8 


2.0*43 


1.101 


1 .89*4 


1.7*4 7 


1.518 


2.385 


1 .822 


1.337 


1 .8*49 


1 .*4 1*4 


1.200 


1 .59? 


1.757 


1.7*40 


1.266 


1.339 


1.233 


2.106 


1. 8*4*4 


1 .192 


1.232 


1.262 


1.397 



♦The samples were embedded within Chapters *1 and 5 of 
Exploring Elementary Algebra . These chapters consisted in 
total of 1*3,097 characters. 



TABLE 6 



VARIANCE FOR ESTIMATES OF 1 6 -LETTER RELATIVE 
SEQUENTIAL CONSTRAINT OF SAMPLES FROM 
TWO MATHEMATICS BOOKS 



Sample 


Learning 

Mathematics 


Exploring Elementary 
Algebra 


5,000 


0.173 


0.095 


20,000 


0 . 01*4 


0.012 


25,000 


0.007 


0,060 


30,000 


0.000 


0.111 


35,000 


0 <, 00*4 


0,060 


'(0,000 


0.003 


0.036 



TABLE ? 

PROXIMITY OF SAMPLE* AND TOTAL SELECTION** ESTIMATES 
OF 16 -LETTER RELATIVE SEQUENTIAL CONSTRAINT 





Sample 

Size 


Learning 

Mathematics 


Exploring Elementary 
Algebra 



5,000 


1 . 71 ** 


2.096 


20,000 


1.052 


1.869 


25,000 


0.920 


1 . 3**3 


30,000 


0.898 


1.611 


35,000 


0.892 


1.512 


* 40,000 


0 . 8*48 


1.399 


Total Selection 


0.853 


1 * 35 ** 



•Sample values reported represent the menus of five 
random snmples of the Indicated size* Values for the random 
samples ere reported In Tableg *j and 5 * 

**In Learning Mathematics the total selection consisted 
of oil of Chapter l and ?, a total of * 4?, 295 characters 
•vhlle In Explorl ng Elementary Alee bra Chapters *4 and 5 
consisting of 113*097 characters were used* 



TABLE 8 



ESTIMATES OF 1 6 -LETTER RELATIVE SEQUENTIAL CONSTRAINT 
FOR 20,000 CHARACTER RANDOM SAMPLES OF 
CONTINUOUS TEXTUAL MATERIAL FROM MODERN AND 
TRADITIONAL ELEMENTARY SCHOOL MATHEMATICS BOOKS 



Alphabet Estimates of l6~Letter 
Book Size Relative Sequential 

Constraint 



Developing 

Mathematics 


55 


1.009 


Row-Peterson 
Arithmetic 3 


51 


1.1^9 


Understanding 

Mathematics 


62 


1.656 


Row-Peterson 
Arithmetic U 


52 


1J110 


Learning 

Mathematics 


56 


1.638 


Row-Petereon 
Arithmetic 5 


51 


1.189 


Unifying 

Mathematics 


1j6 


0.?21 


Row-Peterson 
Arithmetic & 


^8 


0.8*J? 



TABLE 9 

ESTIMATES OF 16-LETTER RELATIVE SEQUENTIAL 
CONSTRAINT FOR 20,000 CHARACTER RANDOM SAMPLES 
OF CONTINUOUS TEXTUAL MATERIAL FROM MODERN 
AND TRADITIONAL JUNIOR HIGH SCHOOL 
MATHEMATICS BOOKS 



Alphabet Estimates of 16-Letter 
Book Size Relative Sequential 

Constraint 



Structuring 

Mathematics 


55 


1.^91 


Rox-Peterson 


53 


1 -339 


Arithmetic 7 


Extend ing 
Mathematics 


56 


O.856 


Row-Peterson 
Arithmetic 8 


57 


1.011 
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TABLK10 



ESTIMATES OF If -LETTER RELATIVE SEQUENTIAL CONSTRAINT 
FOR 20,000 CHARACTER RANDOM SAMPLE'S OF 
CONTINUOUS TEXTUAL MATERIAL FROM MODERN AND TRADITIONAL 
SENIOR HI OH SCHOOL MATHEMATICS ROOKS 



Alphabet Estimates of 16-Letter 



Book 


Size 


Relative Sequential 
Constraint 


Exploring 
Element ary 
Algebra 


61 


1.869* 


Algebra 
Book One 


50 


1.079 


Exploring 

Geometry 


67 


0.977 


A First Course 
In Geometry 


57 


0.995 


Advanced 
High School 
Mathematics 


60 


1.219 


Elementary 
Math emat.lt cal 
Analysis 


65 


1.22U 



{♦Represents the mean of the flvo 20,000 character 
random samples reported in Table 5 
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TABLE 11 



WITHIN BOOK DIFFERENCES IN ALPHABET SIZE AND 
ESTIMATE OF 1 6 -LETTER RELATIVE SEQUENTIAL 
CONSTRAINT FOR 20,000 SYMbOL RANDOM SAMPLES 
OF CONTINUOUS TEXTUAL MATERIAL 



Book Topic Alphabet Estimate of l 6 -Letter 

Size Relative Sequential 
Constraint 



Learning 


Fractions 


56 


1.638 


Mathematics 


Geometry 


6l 


1.052* 


Advanced 
High School 


Trigonometry and 
Probability** 


60 


1.219 


Mathematics 


Matrices, Vectors 
and Limits** 


63 


1.179 


Row-Peterson 


Measurement 


51 


1.1^9 


Arithmetic 3 


Basic Operations 


53 


1 . 1*69 


Row-Peterson 


Geometry 


57 


1.011 


Arithmetic 5 


Fractions and 
Decimals** 


5 ^ 


1.812 



•Represents the mean of the five 20,000 character random 
samples reported In Table ^ . 



**In this study these toolcs composed the same sample, 

TABLE 12 

ALPHABET SIZE AND ESTIMATE OF 16 -LETTER RELATIVE 
SEQUENTIAL CONSTRAINT BETWEEN BOOKS AT DIFFERENT 
GRADE LEVELS WITH TOPIC CONTROLLED 



, 




Alphabet 


Estimates of 1 6-Letter 


Book 


Topic 


Size 


Relative Sequential 
Constraint 



Row-Peterson 
Arithmetic j 


Basic 

Operations 


53 


1 .1169 


Row-Pet arson 
A 1 *! thmotlc 


lVa sic 

Operations 


52 


1 .MO 


learning 

Mathematics 


Geometry 


6l 


1 . 052 * 


Extend 1 nr. 
Mathematics 


Geometry 


56 


O .856 



% 



•Represents the mean of the five 20,000 character 
random samples reported In Table lj. 



TABLE 13 



ALPHABET SIZE AND ESTIMATES OF RELATIVE SEQUENTIAL 
CONSTRAINT FOR TWO STYLES OF MATHEMATICAL LANGUAGE 



Style Alphabet Size Estimate of l6-Letter 

Relative Sequential 
Constraint 



Deductive 



61 



1.294 



Non-Deduct lve 



56 



1.124 



TABLE 14 

MEAN* AND ESTIMATES OF 1 6 -LETTER RELATIVE SEQUENTIAL 
CONSTRAINT FOR FIVE MATHEMATICS PASSAGES 



Passage 


Estimate of l6~Letter 
Relative Sequential 
Constraint 


Mean* 


Matrices 




2.669 


1?.0??4 


Metric System 




1.579 


12.9129 


Studying Matrices 




2.565 


12.5721 


Statistics 




3.004 


10.7994 


Loglo 




2.189 


14.3951 


*Mean of at proximo 
co.rprehensl on test over 


trly 

the 


125 students on a 
passage . 


read 1 ng 
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